Serveur d'exploration sur l'OCR

Attention, ce site est en cours de développement !
Attention, site généré par des moyens informatiques à partir de corpus bruts.
Les informations ne sont donc pas validées.

Open Source OCR Framework Using Mobile Devices

Identifieur interne : 000D09 ( Main/Exploration ); précédent : 000D08; suivant : 000D10

Open Source OCR Framework Using Mobile Devices

Auteurs : Steven Zhiying Zhou [Singapour] ; SYED OMER GILANI [Singapour] ; Stefan Winkler [Singapour]

Source :

RBID : Pascal:08-0426718

Descripteurs français

English descriptors

Abstract

Mobile phones have evolved from passive one-to-one communication device to powerful handheld computing device. Today most new mobile phones are capable of capturing images, recording video, and browsing internet and do much more. Exciting new social applications are emerging on mobile landscape, like, business card readers, sing detectors and translators. These applications help people quickly gather the information in digital format and interpret them without the need of carrying laptops or tablet PCs. However with all these advancements we find very few open source software available for mobile phones. For instance currently there are many open source OCR engines for desktop platform but, to our knowledge, none are available on mobile platform. Keeping this in perspective we propose a complete text detection and recognition system with speech synthesis ability, using existing desktop technology. In this work we developed a complete OCR framework with subsystems from open source desktop community. This includes a popular open source OCR engine named Tesseract for text detection & recognition and Flite speech synthesis module, for adding text-to-speech ability.


Affiliations:


Links toward previous steps (curation, corpus...)


Le document en format XML

<record>
<TEI>
<teiHeader>
<fileDesc>
<titleStmt>
<title xml:lang="en" level="a">Open Source OCR Framework Using Mobile Devices</title>
<author>
<name sortKey="Zhiying Zhou, Steven" sort="Zhiying Zhou, Steven" uniqKey="Zhiying Zhou S" first="Steven" last="Zhiying Zhou">Steven Zhiying Zhou</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Syed Omer Gilani" sort="Syed Omer Gilani" uniqKey="Syed Omer Gilani" last="Syed Omer Gilani">SYED OMER GILANI</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Winkler, Stefan" sort="Winkler, Stefan" uniqKey="Winkler S" first="Stefan" last="Winkler">Stefan Winkler</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
</titleStmt>
<publicationStmt>
<idno type="wicri:source">INIST</idno>
<idno type="inist">08-0426718</idno>
<date when="2008">2008</date>
<idno type="stanalyst">PASCAL 08-0426718 INIST</idno>
<idno type="RBID">Pascal:08-0426718</idno>
<idno type="wicri:Area/PascalFrancis/Corpus">000265</idno>
<idno type="wicri:Area/PascalFrancis/Curation">000519</idno>
<idno type="wicri:Area/PascalFrancis/Checkpoint">000231</idno>
<idno type="wicri:Area/Main/Merge">000D21</idno>
<idno type="wicri:Area/Main/Curation">000D09</idno>
<idno type="wicri:Area/Main/Exploration">000D09</idno>
</publicationStmt>
<sourceDesc>
<biblStruct>
<analytic>
<title xml:lang="en" level="a">Open Source OCR Framework Using Mobile Devices</title>
<author>
<name sortKey="Zhiying Zhou, Steven" sort="Zhiying Zhou, Steven" uniqKey="Zhiying Zhou S" first="Steven" last="Zhiying Zhou">Steven Zhiying Zhou</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Syed Omer Gilani" sort="Syed Omer Gilani" uniqKey="Syed Omer Gilani" last="Syed Omer Gilani">SYED OMER GILANI</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
<author>
<name sortKey="Winkler, Stefan" sort="Winkler, Stefan" uniqKey="Winkler S" first="Stefan" last="Winkler">Stefan Winkler</name>
<affiliation wicri:level="1">
<inist:fA14 i1="01">
<s1>Interactive Multimedia Lab, Department of Electrical and Computer Engineering National University of Singapore, 10 Kent Ridge Crescent</s1>
<s2>Singapore 117576</s2>
<s3>SGP</s3>
<sZ>1 aut.</sZ>
<sZ>2 aut.</sZ>
<sZ>3 aut.</sZ>
</inist:fA14>
<country>Singapour</country>
<wicri:noRegion>Singapore 117576</wicri:noRegion>
</affiliation>
</author>
</analytic>
<series>
<title level="j" type="main">Proceedings electronic imaging science and technology</title>
</series>
</biblStruct>
</sourceDesc>
<seriesStmt>
<title level="j" type="main">Proceedings electronic imaging science and technology</title>
</seriesStmt>
</fileDesc>
<profileDesc>
<textClass>
<keywords scheme="KwdEn" xml:lang="en">
<term>Character recognition</term>
<term>Electronic trade</term>
<term>Information browsing</term>
<term>Internet</term>
<term>Linguistic analysis</term>
<term>Mobile phone</term>
<term>Mobile platform</term>
<term>Mobile radiocommunication</term>
<term>Optical character recognition</term>
<term>Pattern recognition</term>
<term>Portable equipment</term>
<term>Reader</term>
<term>Speech synthesis</term>
<term>Subsystem</term>
<term>System synthesis</term>
<term>Wireless telecommunication</term>
</keywords>
<keywords scheme="Pascal" xml:lang="fr">
<term>Reconnaissance optique caractère</term>
<term>Radiocommunication service mobile</term>
<term>Téléphone portable</term>
<term>Appareil portatif</term>
<term>Navigation information</term>
<term>Internet</term>
<term>Commerce électronique</term>
<term>Lecteur</term>
<term>Plateforme mobile</term>
<term>Reconnaissance caractère</term>
<term>Synthèse système</term>
<term>Synthèse parole</term>
<term>Sous système</term>
<term>Analyse linguistique</term>
<term>Reconnaissance forme</term>
<term>Télécommunication sans fil</term>
</keywords>
<keywords scheme="Wicri" type="topic" xml:lang="fr">
<term>Commerce électronique</term>
<term>Télécommunication sans fil</term>
</keywords>
</textClass>
</profileDesc>
</teiHeader>
<front>
<div type="abstract" xml:lang="en">Mobile phones have evolved from passive one-to-one communication device to powerful handheld computing device. Today most new mobile phones are capable of capturing images, recording video, and browsing internet and do much more. Exciting new social applications are emerging on mobile landscape, like, business card readers, sing detectors and translators. These applications help people quickly gather the information in digital format and interpret them without the need of carrying laptops or tablet PCs. However with all these advancements we find very few open source software available for mobile phones. For instance currently there are many open source OCR engines for desktop platform but, to our knowledge, none are available on mobile platform. Keeping this in perspective we propose a complete text detection and recognition system with speech synthesis ability, using existing desktop technology. In this work we developed a complete OCR framework with subsystems from open source desktop community. This includes a popular open source OCR engine named Tesseract for text detection & recognition and Flite speech synthesis module, for adding text-to-speech ability.</div>
</front>
</TEI>
<affiliations>
<list>
<country>
<li>Singapour</li>
</country>
</list>
<tree>
<country name="Singapour">
<noRegion>
<name sortKey="Zhiying Zhou, Steven" sort="Zhiying Zhou, Steven" uniqKey="Zhiying Zhou S" first="Steven" last="Zhiying Zhou">Steven Zhiying Zhou</name>
</noRegion>
<name sortKey="Syed Omer Gilani" sort="Syed Omer Gilani" uniqKey="Syed Omer Gilani" last="Syed Omer Gilani">SYED OMER GILANI</name>
<name sortKey="Winkler, Stefan" sort="Winkler, Stefan" uniqKey="Winkler S" first="Stefan" last="Winkler">Stefan Winkler</name>
</country>
</tree>
</affiliations>
</record>

Pour manipuler ce document sous Unix (Dilib)

EXPLOR_STEP=$WICRI_ROOT/Ticri/CIDE/explor/OcrV1/Data/Main/Exploration
HfdSelect -h $EXPLOR_STEP/biblio.hfd -nk 000D09 | SxmlIndent | more

Ou

HfdSelect -h $EXPLOR_AREA/Data/Main/Exploration/biblio.hfd -nk 000D09 | SxmlIndent | more

Pour mettre un lien sur cette page dans le réseau Wicri

{{Explor lien
   |wiki=    Ticri/CIDE
   |area=    OcrV1
   |flux=    Main
   |étape=   Exploration
   |type=    RBID
   |clé=     Pascal:08-0426718
   |texte=   Open Source OCR Framework Using Mobile Devices
}}

Wicri

This area was generated with Dilib version V0.6.32.
Data generation: Sat Nov 11 16:53:45 2017. Site generation: Mon Mar 11 23:15:16 2024